Flagging clickbait in Indonesian online news websites using fine-tuned transformers
نویسندگان
چکیده
Click counts are related to the amount of money that online advertisers paid news sites. Such business models forced some sites employ a dirty trick click-baiting, i.e., using hyperbolic and interesting words, sometimes unfinished sentences in headline purposefully tease readers. Some Indonesian also joined party clickbait, which indirectly degrade other established sites' credibility. A neural network with pre-trained language model multilingual bidirectional encoder representations from transformers (BERT) acted as an embedding layer is then combined 100 node-hidden topped sigmoid classifier was trained detect clickbait headlines. With total 6,632 headlines training dataset, performed remarkably well. Evaluated 5-fold cross-validation, it has accuracy score 0.914, F1-score precision 0.916, receiver operating characteristic-area under curve (ROC-AUC) 0.92. The usage BERT text classification task tested possible be enhanced further. Future possibilities, societal impact, limitations detection discussed.
منابع مشابه
Clickbait detection using word embeddings
Clickbait is a pejorative term describing web content that is aimed at generating online advertising revenue, especially at the expense of quality or accuracy, relying on sensationalist headlines or eyecatching thumbnail pictures to attract click-throughs and to encourage forwarding of the material over online social networks. We use distributed word representations of the words in the title as...
متن کاملClickbait Identification using Neural Networks
This paper presents the results of our participation in the Clickbait Detection Challenge 2017. The system relies on a fusion of neural networks, incorporating different types of available informations. It does not require any linguistic preprocessing, and hence generalizes more easily to new domains and languages. The final combined model achieves a mean squared error of 0.0428, an accuracy of...
متن کاملMeasuring Actual Visitor Engagement in News Websites
As revenues from Internet advertising continue to grow, advertisers seek popular news websites for placing advertisements in an effort to maximize profits. An important measure of how well a website is performing or how attractive it is to the advertisers is how engaged the web visitors are with that website. During our background study, we explored articles covering metrics to measure online u...
متن کاملStatistics Hacking - Exploiting Vulnerabilities in News Websites
We analyze and discuss a vulnerability in leading news websites, that can lead to modification of the system statistics by malicious users (statistics hacking). We outline two broad categories of methods that can counter statistics hacking. The first category consists of many methods already available to distinguish human users from computers. We compare the different methods within this catego...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Power Electronics and Drive Systems
سال: 2023
ISSN: ['2722-2578', '2722-256X']
DOI: https://doi.org/10.11591/ijece.v13i3.pp2921-2930